oddball: Spotting Anomalies in Weighted Graphs

نویسندگان

  • Leman Akoglu
  • Mary McGlohon
  • Christos Faloutsos
چکیده

Given a large, weighted graph, how can we find anomalies? Which rules should be violated, before we label a node as an anomaly? We propose the OddBall algorithm, to find such nodes. The contributions are the following: (a) we discover several new rules (power laws) in density, weights, ranks and eigenvalues that seem to govern the socalled “neighborhood sub-graphs” and we show how to use these rules for anomaly detection; (b) we carefully choose features, and design OddBall, so that it is scalable and it can work un-supervised (no user-defined constants) and (c) we report experiments on many real graphs with up to 1.6 million nodes, where OddBall indeed spots unusual nodes that agree with intuition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Anomaly Detection in Large Graphs

Discovering anomalies is an important and challenging task for many settings, from network intrusion to fraud detection. However, most work to date has focused on clouds of multi-dimensional points, with little emphasis on graph data; even then, the focus is on un-weighted, node-labeled graphs. Here we propose OddBall, an algorithm to detect anomalous nodes in weighted graphs. The contributions...

متن کامل

Quantifying target spotting performances with complex geoscientific imagery using ERP P300 responses

Geoscientific data interpretation is a challenging task, which requires the detection and synthesis of complex patterns within data. As a first step towards better understanding this interpretation process, our research focuses on quantitative monitoring of interpreters' brain responses associated with geoscientific target spotting. This paper presents a method that profiles brain responses usi...

متن کامل

A Product Graph Based Method for Dual Subgraph Matching Applied to Symbol Spotting

Product graph has been shown as a way for matching subgraphs. This paper reports the extension of the product graph methodology for subgraph matching applied to symbol spotting in graphical documents. Here we focus on the two major limitations of the previous version of the algorithm: (1) spurious nodes and edges in the graph representation and (2) inefficient node and edge attributes. To deal ...

متن کامل

The effect of estimation methods on fractal modeling for anomalies’ detection in the Irankuh area, Central Iran

This study aims to recognize effect of Ordinary Kriging (OK) and Inverse Distance Weighted (IDW) estimation methods for separation of geochemical anomalies based on soil samples using Concentration-Area (C-A) fractal model in Irankuh area, central Iran. Variograms and anisotropic ellipsoid were generated for the Pb and Zn distribution. Thresholds values from the C-A log-log plots based on the e...

متن کامل

Anomaly Detection in Time Series of Graphs using ARMA Processes

There are many situations in which indicators of changes or anomalies in communication networks can be helpful, e.g. in the identification of faults. A dynamic communication network is characterised as a series of graphs with vertices representing IP addresses and edges representing information exchange between these entities weighted by packets sent. Ten graph distance metrics are used to crea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010